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A method of identifying a nucleic acid sequence in a biological sample comprises using a pair of universal oligonucleotide primers 
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Method for Identifying nucleic add sequences from different blologlca 
sources through amplification of the same 

5 

The present invention relates to improvements in and relating 
to the amplification and identification of nucleotide base sequences, and in 
particular but not exclusively to improvements in and relating to the 
10 amplification and identification of nucleotide base sequences using the 
polymerase chain reaction (PGR). 

The PCR is a useful technique for amplifying genetic 
sequences. One application of this is the amplification of target gene 
sequences in biological samples from for example environmental, food and 
15 medical sources etc. to allow identification of causative, pathogenic, 
spoilage or indicator organisms present in the sample. 

Conventionally, the PCR is carried out using primers chosen 
or produced to amplify a particular target gene sequence within a given 
organism. Consequently, currently the PCR can only be used to amplify a 
20 single predetermined gene sequence at any one time and hence to confirm 
the presence or absence of that target sequence and the corresponding 
organism. Thus, when the identity of one or more organisms in a sample is 
to be established, it is necessary to guess or assume what 
sequences/organisms may be present, identify and obtain specific primers 
25 that are operable to amplify that sequence or a sequence indicative of that 
organism, and then conduct the experiment. Such procedures are time 
consuming and expensive, and somewhat unreliable. 

It is an object of the present invention to obviate or mitigate 
one or more of the above disadvantag s. 
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Throughout this specification the terms "unidentified 
sequence" and "unidentified nucleotide sequence" are used to mean that 
the sequence has not yet been identified in the particular sample being 
analysed, not necessarily one which is wholly unknown. It may be that 
5 once an unidentified sequence has been amplified and characterised it is 
one with which scientists are familiar. 

In one aspect the invention provides a method of identifying a 
nucleic acid sequence in a biological sample, which method comprises 
using at least one pair of oligonucleotide primers to amplify the nucleic acid 
10 sequence, and characterising the amplification reaction products, 

characterised in that the or each pair of oligonucleotide primers is operable 
to enable the amplification of nucleic acid sequences from different 
biological sources. 

The amplification method is not material to the invention. 
15 Preferably amplification is effected by the polymerase chain reaction 
(PCR). 

The method by which amplification reaction products are 
characterised is also not material to the invention. Such characterisation 
may be effected by known techniques, e.g. sequencing, or by the use of 

20 single-strand conformation polymorphism (SSCP) analysis, or by the use of 
a labelled probe which binds to the amplification reaction products. One 
suitable system is that marketed by Perkin Elmer under the trademark 
TaqMan which is described in more detail below. 

The method of the invention is characterised by the use of at 

25 least one pair of oligonucleotide primers operable to enable the 

amplification of nucleic acid sequences from different biological sources. 
These so-called universal primers can be selected from regions of genes 
which are largely or completely conserved in many different biological 
organisms. Pairs of univ rsal primers may be found by routine search in 

30 various different genes. To the best of the inv ntors' knowledge, the use 



WO 96/15264 



PCT/GB95/02654 



of such pairs of universal primers in order to identify the biological source 
of a particular nucleic acid sequence has not previously been proposed. 

In another aspect the invention provides a kit for identifying a 
nucleic acid sequence in a biological sample by the method described 
5 which kit comprises reagents, including at least one pair of oligonucleotide 
primers operable to enable the amplification of nucleic acid sequences 
from different biological sources, for amplifying the nucleic acid sequence, 
and means for characterising the amplification reaction products. 

Studies have revealed that the proteins involved in the cold- 
10 shock response in prokaryotic organisms are remarkably conserved 

throughout different organisms. Such conservation also extends to related 
proteins in eukaryotic organisms, for example the Y-box proteins. Very 
highly conserved regions of the amino acid sequences of these proteins 
have been identified, and recognised by the inventors as potential sites for 
1 5 universal amplification primers. 

Preferably a pair of oligonucleotide primers includes a first 
oligonucleotide primer which is substantially complementary to a nucleic 
acid sequence that codes for the peptide sequence GXVKWFNXXKGFGFI 
(SEQ id NO: 1) where X is any amino acid, or for a fragment thereof that 
20 includes at least four identified amino acids. 

Herein, peptide sequences are shown in the N to C terminus 
direction. Oligonucleotides are shown in the 5 1 to 3' direction. Reference 
to an amino acid in a peptide sequence is to be understood as a reference 
to the residue of that amino acid. Reference to a nucleotide in an 
25 oligonucleotide sequence is to be understood as a reference to a residue 
of that nucleotide. An identified amino acid is one not indicated by the 
symbol X. 

Preferably the said first oligonucleotide primer has the 
s quence GGTANAGTAAAATGGTTNAACNC (SEQ ID NO: 2) where N is any 
30 nucleotide, or a fragment thereof containing at least twetv nucleotides. 
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Pr ferably a second oligonucleotide primer is substantially 
complementary to a nucleic acid sequence that codes for the peptide 
sequence GPXAXNVTXX (SEQ id no: 3) where X is any amino acid, or for a 
fragment thereof that includes at least four identified amino acids. 
5 Preferably the said second oligonucleotide primer has the sequence 
GGTTACGTTANCNGCTNNNGGNCC (SEQ ID NO: 4) where N is any 
nucleotide, or a fragment thereof containing at least twelve nucleotides. 

Preferably a third oligonucleotide primer used is substantially 
complementary to a nucleic acid sequence that codes for the peptide 
1 0 sequence DVFVHFSAIQ (SEQ ID NO: 5). 

Preferably the said third oligonucleotide primer used has the 
sequence GATGTATTCGTACATTTCTCTGCTATCC (SEQ ID NO 6) or its 
reverse complement GGATAGCAGAGAAATGTACGAATACATC (SEQ ID 
NO: 7) or a fragment of either containing at least twelve nucleotides or an 
1 5 oligonucleotide having at least 80% homology therewith. The third 

oligonucleotide primer or its reverse complement forms a pair with either 
the second or the first oligonucleotide primers, respectively. Alternatively, 
the first oligonucleotide primer forms a pair with the second oligonucleotide 
primer. 

20 In another aspect the invention provides as new products, a 

pair of oligonucleotide primers comprising the first and one of the other two 
oligonucleotides, and a pair of oligonucleotide primers comprising the 
second and third oligonucleotides, and a set of primers comprising all three 
oligonucleotides. 

25 Reference is directed to the accompanying drawings in which 

Figure 1 is a comparison of the aligned amino acid sequences of the major 
cold shock proteins of various bacteria and of a number of eukaryotic Y- 
box proteins. 

Figur 2 shows the sequences of three cons rv d regions of 
30 g nes which cod for these prot ins, and includ s three oligonucleotides 
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whose sequences or reverse complementary sequences are suitable for 
use as primers according to this invention. 

Figure 3 is a schematic representation of an oligonucleotide 
primer set in use in the amplification of a nucleic acid sequence according 
5 to this invention. 

Figures 4 and 5 show the front and back oligonucleotide 
primers from Figure 2, and how they relate to conserved peptide 
sequences of cold shock and Y-box proteins. 

Figure 1 shows alignments of amino acid sequences of 

10 published cold shock and Y-box proteins. (: = space in sequence; 

- = conserved amino acid). E.c = Escherichia coti, B.s « Bacillus subtilis, 
B.g = Bacillus globigii, B.gl = Bacillus globisporus, B.c = Bacillus 
caldolyticus, B.st = Bacillus stearothermophilus, S.c = Streptomyces 
caivuligers, H.s = Homo sapiens, R.n = Rattus norvegicus, X.I = Xenopus 

15 /aews, A t = Arabidopsis thaliana. The data clearly demonstrates that 
these proteins share large regions of homology. By identifying the most 
highly conserved regions within these alignments, oligonucleotide primers 
were designed which, when used in a PGR reaction using chromosomal 
DNA from various bacteria, facilitate the amplification of short DNA 

20 sequences (-180 bp) possessing regions of homology with published 
major cold shock protein sequences. 

Figure 2 focuses on three highly conserved regions of Figure 
1, and shows how the genes which code for these proteins are also highly 
conserved, both across various bacterial species and also in the frog. 

25 These conserved regions have been used to design three degenerate 
primers; 

- a first or front primer which hybridises to the active coding 
strand of the DNA; 

- a middl or third oligonucleotid which hybridis s to the 
30 coding or the non-coding strand of the DNA; and 
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- a back or second primer which hybrids s to th non-coding 
strand of the DNA. 

In these degenerate oligonucleotides, a capital letter 
designates a nucleotide that is conserved throughout; a small letter 
5 designates a nucleotide that is not universally conserved; and a dot 
indicates that the nucleotide shown below it can be changed at will. 

The three oligonucleotides shown in Figure 3 are indeed 
universal primers. Using them, the inventors have successfully amplified 
DNA sequences in biological samples derived from bacteria, eukaryotes 
10 including rat, frog and man, and plant tissue. For example, nucleic acid 
sequences have been amplified from the following micro-organisms:- 



Bacillus subtilis 



15 



Xanthomonas campestris 
Salmonella typhimurium 
Micrococus luteus 



Chromobacterium violaceum 



Agrobacterium lumefaciens 
Azospirillus 



20 



Rahnella aquaticus 
Pseudomonas putida 
Serratia liquifaciens 



Serratia marcescens 



Pseudomonas fluorescens 



25 



Staphylococcus aureus 
Pseudomonas aemginosa 
Pseudomonas mesoacidophilus 
Photobacterium leiognathi 



Vibrio fischeri 



Xenorhabdus luminescens 



30 



Listeria monocytogenes 
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Bacillus stearothermophilus 
Thermophilus aquaticus 
Yersinia enterocolitica 

Amplification was performed under conventional PCR 
5 conditions, using a touchdown technique to minimise interference from 
impurities. The following temperature profile was used:- 

a) 94°C for 6 minutes. 

b) 94°C for 30 seconds. 

60 - 51 °C for 30 seconds. 
10 72°C for 30 seconds. 

b) repeated 20 times with the annealing temperature reduced 
by one degree every second cycle. 

c) 94°C for 30 seconds 
50°C for 30 seconds 

15 72°C for 30 seconds 

c) cycle repeated 20 times. 

Figure 3 shows front and middle primers of Figure 2 
hybridised to the coding strand of a nucleic acid, and the reverse 
complement back primer from Figure 2 hybridised to the non-coding strand 

20 of the DNA. The front and back primers form a pair of oligonucleotide 
primers according to the invention. The middle and back primers form 
another pair of oligonucleotide primers. The three primers together form a 
set according to the invention. 

Figure 4 shows the highly conserved amino acid sequence of 

25 the cold shock proteins used to produce the front primer of the primer pair. 
Figure 5 shows the highly conserved peptide sequence of the cold shock 
proteins used to produce the back primer of the primer pair. Each X in the 
sequence indicates a position in the sequence at which the precise nature 
of the amino acid is not particularly important sine these positions are less 

30 conserved than the others. Consequently th nature of the primer at the 
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corresponding position can vary without d triment to the efficacy of the 
respective primer. 

Figure 4 also shows the front oligonucleotide primer of the 
universal primer pair t alongside the particularly conserved peptide 
5 sequence from which it is derived. It should be noted that the conservation 
of the peptide sequence is greater than that of the corresponding 
nucleotide sequence due to "wobble", i.e. amino acids can be coded for by 
more than one codon triplet. Thus, the primer is one of a number of 
possible primers that are hybridisable to the DNA sequence actually coding 

10 for the conserved sequence GXVKWFNXXKGFGFI (SEQ ID NO: 1). This 
primer is complementary to and hence operable to hybridise to a first 
primer binding region on the coding or sense strand of a cold shock protein 
gene sequence, as shown in Figure 3. 

Figure 5 shows the back oligonucleotide primer of the pair, 

1 5 alongside the highly conserved peptide sequence from which it is derived. 
The back primer is operable to hybridise with a second primer binding 
region on the noncoding or anti-sense strand of a cold shock protein gene 
sequence, again as shown in Figure 3. 

The primer binding regions are separated by a generally less 

20 conserved series of about 180 base pairs. This less well conserved part of 
the gene encodes for a cold-shock response protein, or a related or similar 
protein such as a Y-box protein (eukaryotes). The primer binding regions 
are sufficiently conserved to enable the universal primers to recognise and 
hybridise to the DNA of a large number, possibly all or almost all, 

25 organisms, including both prokaryotic and eukaryotic organisms and 
thereby enable amplification of the intervening sequences from these 
organisms as will be described. The primer pair is at least expected to 
operate with very large classes of organisms and thus to be "universal" in 
this sense. The gene howev r does have a degree of peculiarity in ach 

30 differ nt type of organism, such that characterisation of the amplification 
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products made with the primers and identification of the sequence, the 
RNA, protein and source organism can be made (see later). 

In use, the primers are operable in a generally conventional 
polymerase chain reaction to amplify a gene encoding for a cold shock or 
5 related protein or indeed any sequence bounded by the highly conserved 
primer binding regions. It is preferable that the reaction is conducted using 
a substantially DNA free thermophilic transcription enzyme to mitigate 
"background" amplification i.e. amplification of exogenous DNA associated 
with the enzyme. However, use of an improved polymerase chain reaction 

10 method in accordance with the present invention may render this 

requirement less crucial (see below). A suitable enzyme is replitherm. 

One particular application of the universal primer pair is the 
identification of unidentified source organisms in a biological sample. For 
example, food, environmental and medical samples can be analysed to 

1 5 identify the nature of a particular causative, pathogenic spoilage and/or 
indicator organism. 

Conventionally sequence specific and hence organism- 
specific primers have had to be chosen or produced to enable such 
identification procedures. One significant disadvantage of this, which the 

20 present invention obviates, is the need for the investigator to guess or 

assume what organism may be involved, and test for it. If that organism is 
not involved, then the investigator will have to guess again. This has to be 
repeated until the investigator has correctly guessed or assumed the 
identity of the organism involved and the test has proved this to be the 

25 case. It is feasible that the organism involved is not identified as a result of 
such laborious techniques. Moreover, it may be that more than one 
organism is involved, a fact that may be missed using conventional 
techniques, but which should not be missed using the present invention. 
Th pr s nt inv ntion allows an inv stigator simply to 

30 conduct the polym ras chain reaction using the universal primers to 
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amplify all sequences present in the sample bound by the relevant highly 
conserved primer binding regions. The polymerase reaction products (the 
amplified sequences) can then be characterised, for instance by 
sequencing them or using high resolution gel electrophoresis, to enable 
5 identification of the sequence amplified. It is preferable that techniques ar 
adopted which obviate the need to sequence the amplification products, 
since sequencing is a relatively laborious process. 

Therefore, in accordance with the present invention the gel 
pattern produced using electrophoretic techniques to separate the reaction 

10 products produced using the primers may be compared against a data 
bank of gel patterns produced from known sequences and organisms, 
such that identification of the sequence and hence source organism in the 
sample being analysed can be readily ascertained by matching it with one 
in the data bank. If there is no corresponding pattern on the data bank 

1 5 then the products may have to be sequenced, or identified by another 

appropriate method. It will be appreciated that determination of the identity 
of the sequence and source organism by such comparative procedures is 
highly efficient, particularly when the comparison is conducted by a 
computer. 

20 The use of fluorescent labels to label the PCR products 

facilitates the identification procedure. 

The front and back and middle primers may be labelled with 
fluorescent labels (Figure 3; e.g. FAM, TAMRA, JOE as shown in Figure 
2). Thus, when the reaction products are separated on a gel, the positions 

25 of the reaction products can be established, preferably by using apparatus 
operable to detect the labels. Such apparatus may employ laser 
technology to detect the labels in a conventional manner. 

Once the position of the reaction products have been 
established, they can be calibrated or standardised to take into account 

30 varying parameters such as the type of label used, temperature etc., and 
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then compared, by computer with known patterns. The differences in the 
less conserved intervening sequences will cause the reaction products of 
different sequences to move through the separating gel at different rates, 
as a result of differences in size, charge, etc. It is therefore possible to 
5 distinguish between different sequences, to an accuracy of one base 
difference, as in the fluorescence-based PCR single strand conformation 
polymorphism analysis (PCR-SSCP) [Ellison et a/, BioTechniques, Vol 15 
No 4 (1993) p684; and Ellison et a/, BioTechniques, Vol 17 No 4, 1994, p 
742], and hence to distinguish and identify the sequence and source 
10 organism in this way. 

If the procedure results in a pattern which does not compare 
with a known pattern, the investigator is still alerted to the presence of an 
organism, which can then be further investigated by other procedures. 

A preferred method of characterising the amplification 
15 reaction products involves use of the system marketed by Perkin Elmer 
under the trademark TaqMan. This is a PCR assay system that uses the 
hydrolysis of a fluorogenic probe to monitor the extent of amplification. 
The probe consists of an oligonucleotide labelled with both a fluorescent 
reporter and a quencher dye. During the PCR process, this probe is 
20 cleaved by the 5' nuclease activity of taq DNA polymerase if and only if it 
hybridises to the segment being amplified. Cleavage of the probe 
generates an increase in the fluorescence intensity of the reporter dye. 
Thus, amplification of a specific product can be detected by simply 
measuring fluorescence after PCR. A separate probe is required for each 
25 bacterial or other species to be investigated. But once synthesised, the 
probes are very quick and easy to use. 

As previously mentioned, an improved method of amplifying 
and identifying a nucleotide sequence comprises using a primer set of at 
least three cooperable primers to amplify the sequence and at least one 
30 portion thereof. The primers of the primer s t are preferably the thre 
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universal primers as described above. However, it is to be appreciated 
that the improved method is operable with non-universal primers, for 
example to increase the fidelity of the PCR-SSCP analysis mentioned 
above. 

5 The improved method likewise increases the specificity and 

fidelity of the characterisation and identification methods using the 
universal primer set described above. 

Generally conventional PCR methodology is employed to 
simultaneously amplify different parts of the sequence using these primers. 

10 Two principal reaction products are produced, one between the front and 
back primers, and one between either the middle and back primers or the 
front and middle primers depending on which of the middle primers is used. 

Thus two different distinguishable sets of reaction products 
are produced for each sequence amplified. Therefore, the gel pattern 

1 5 produced by separation of these is likely to be more specific and therefore 
more distinguishable for a given sequence and hence organism than a gel 
pattern of reaction products produced using only one primer pair. 

Each of the reaction products can have a different label to 
enable them to be identified on the gel. 

20 It is to be appreciated that use of three primers, or more 

would be in accordance with the present invention. Indeed the more 
primers that were used, the greater the fidelity and specificity of the 
technique. The use of four primers is envisaged, where front and back 
primers plus two middle primers as described above are used. In this 

25 case, the two middle primers will be of opposite sense but each will 

correspond to a different part of the conserved region to avoid formation of 
primer dimers. 

Using the present invention, the investigator can readily 
resolve the general question, "What is present?", rather than ask the 

30 specific qu stion "Is organism X pr s nt? w . 
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The invention therefore provides a considerably simpler and 
more efficient means to identify nucleotide sequences and source 
organisms in biological samples. 

It is to be appreciated that proteins other than the cold shock 
5 proteins (and related proteins) may comprise suitable highly conserved 
regions appropriate for producing universal primers, and that the present 
invention extends to primer pairs, sets and methods suitable for use in 
relation thereto. For example, ribosomal proteins, sigma subunits and 
RsmA comprise regions that appear highly conserved throughout a range 
10 of sources, and therefore would appear to provide appropriate sites for 
suitable universal primers. 

Moreover, universal primers may be used in reactions other 

than PCR. 
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CLAIMS 

5 

1. A method of identifying a nucleic acid sequence in a 
biological sample, which method comprises using at least one pair of 
oligonucleotides primers to amplify the nucleic acid sequence, and 

10 characterising the amplification reaction products, characterised in that the 
or each pair of oligonucleotide primers is operable to enable the 
amplification of nucleic acid sequences from different biological sources. 

2. A method as claimed in claim 1 , wherein the amplification is 
performed by the polymerase chain reaction. 

15 3. A method as claimed in claim 1 or claim 2, wherein the 

characterising is performed by sequencing or by single-strand 
conformation polymorphism analysis or by the use of a labelled probe 
which binds to the amplification reaction products. 

4. A method as claimed in any one of claims 1 to 3 t wherein th 
20 oligonucleotide primers used hybridise to a gene which codes for a cold 

shock protein or a eukaryotic Y-box protein. 

5. A method as claimed in any one claims 1 to 4, wherein a first 
oligonucleotide primer used is substantially complementary to a nucleic 
acid sequence that codes for the peptide sequence GXVKWFNXXKGFGFI 

25 (SEQ ID NO: 1) where X is any amino acid, or for a fragment thereof that 
includes at least four identified amino acids. 

6. A method as claimed in claim 5, wherein another 
oligonucleotide primer used is substantially complementary to a nucleic 
acids quence that codes for the p ptidesequ nee GPXAXNVTXX 

30 (SEQ ID NO: 3) wher 
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X is any amino acid, or for a fragment th reof that includes at I ast four 
identified amino acids. 

7 - A method as claimed in claim 5, wherein the oligonucleotide 

primer used has the sequence GGTANAGTAAAATGGTTNAACNC 
5 (SEQ ID NO: 2) where N is any nucleotide, or a fragment thereof containing at 
least twelve nucleotides. 

8. A method as claimed in claim 6, wherein the oligonucleotide 

primer used has the sequence GGTTACGTTANCNGCTNNNGGNCC 
(SEQ id NO: 4) where N is any nucleotide, or a fragment thereof containing at 
1 0 least twelve nucleotides. 

9 - A method as claimed in any one of claims 5 to 8, wherein 

another oligonucleotide primer used has the sequence 
GATGTATTCGTACATTTCTCTGCTATCC (SEQ ID NO: 6) or its reverse 
complement GGATAGCAGAGAAATGTACGAATACATC (SEQ id NO: 7) or a 
1 5 fragment of either containing at least twelve nucleotides 

10. A pair of oligonucleotide primers, operable to enable the 

amplification of nucleic acid sequences from different biological sources, 
comprising a first oligonucleotide primer substantially complementary to a 
nucleic acid sequence that codes for the peptide sequence 
20 GXVKWFNXXKGFGFI (SEQ ID NO: 1) where X is any amino acid, or for a 
fragment thereof that includes at least four identified amino acids, and 
another oligonucleotide primer substantially complementary to a nucleic 
acid sequence that codes for the peptide sequence GPXAXNVTXX 
(SEQ ID NO: 3)where X is any amino acid, or for a fragment thereof that 
25 includes at least four identified amino acids. 

11. A set of oligonucleotide primers, operable to enable the 

amplification of nucleic acid sequences from different biological sources, 
comprising a first oligonucleotide primer having the sequence 
GGTANAGTAAAATGGTTNAACNC (SEQ ID NO: 2) where N is any 
30 nucl otid , or a fragment thereof containing at least twelve nucl otides, and at 
least one 
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other oligonucleotid primer having a sequence selected from (i) 
GGTTACGTTANCNGCTNNNGGNCC (SEQ ID NO: 4) where N is any 
nucleotide, or a fragment thereof containing at least twelve nucleotides and 
(ii) GATGTATTCGTACATTTCTCTGCTATCC (SEQ ID NO: 6) or its reverse 
5 complement GGATAGCAGAGAAATGTACGAATACATC (SEQ ID NO: 7) or a 
fragment of either containing at least twelve nucleotides 

12. A kit for identifying a nucleic acid sequence in a biological 
sample by the method of any one of claims 1 to 9, which kit comprises 
reagents, including at least one pair of oligonucleotide primers operable to 

1 0 enable the amplification of nucleic acid sequences from different biological 
sources, for amplifying the nucleic acid sequence, and means for 
characterising the amplification reaction products. 

13. A kit as claimed in claim 12, wherein the at least one pair of 
oligonucleotide primers is a pair or set of oligonucleotide primers as 

1 5 claimed in claim 1 0 or claim 1 1 . 
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MSGKMTG I VKWFNADKGFGFI TPDDGSKDVFVH FSAI QN 

--N L S-V 

-:A-IK-Q ES A G 

-: : : EK-T NA C-EG-GE-I-A-Y-T — M 

: LE-K SE EV:E-QD G 

-EV: E-QD G 

-EV:E-QD G 

-EVEG — : T G 

-EVEG — : T---G 

-AQ-G-GP Y NA 

-NRN-TKE QT--KK 



SE--L 

I SE 

:QR-K NE--Y 

:QR-K NE--Y 

: A--T E 

IAT-VL-T VRN-Y 

IAT-VL-T VRN-Y NRN-TKE CT--KK 

IAT-VL-T VRN-Y NRN-TKE QT--KK 

GGERRK-S DTQ GD-L Q-S-RS 



E.c 

E.c 

E.c 

E.c 

B. s 

B.g 

B.gl 

B.c 

B.st 

S.c 

H.s 

R. n 

X.l 

A. t 



CspA 

CspB 
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: : : DGYKSLDEGCjKVSFTI : ESGAKGPAAGNVTSL 

: : : -N-RT-F T-S- : A — IITD 

: :N-F-T-A N-E-E- : QD-Q V AI 

: : RT-KA--S-Q-DV:HQ-P--NH-SVIVPVEVEVAAVA 

: :E-F-T-E A E- : VE-NR — Q-A KEA 

: :E-F-T-ES~A E- : VE-NR — Q-A 

: :E-FCT-E A E- : VE-NR- -Q- A 

: :E-F-T-E — EA EQ : VE-NR — Q-AV-L 

: : E-FST-E — EA EQ: VE-NR— Q-AV-L 

: : T-FR--E-N-V-N-DV: Tfl-E : — Q-E--SPA 

NNPRKYLR-VGD-ET-E-DV : VE-E--EE-A GPG 

NNPRKYLR-VGD-ET-E-DV: VE-E--AE-A GPG 

NNPRKYLR-VGD-ET-E-DV :VE-E--AE -A GPG 

: : E-FR--AAEE — E-EVEIDN-NR-K-ID-SGPD 
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Fig.2. 

PRONT B. C CSpA GGTATCGTAAAATGGTTCAACGC 20/23 (SEQ ID NO: 8) 

B. C CSpB GGTTTAGTAAAATGGTTTAACGC 21/23 (SEQ 10 NO: 9) 

B. C CSpC GGTCAGGTTAAGTGGTTCAACGA 17/23 (SEQ ID NO: 10) 

S. t C3pS GGTATCGTAAAATGGTTCAACGC 2 0/23 (SEQ ID NO: 11) 

S. t CSpB GGTTTAGTAAAATGGTTTAACCC 2 0/23 (SEQ ID NO: 12) 

B. S CSpB GGTAAAGTAAAATGGTTCAACTC 21/23 (SEQ ID NO: 13) 

L. m CBpL GGTACAGTAAAATGGTTTAACGC 22/23 (SEQ ID NO: 14) 

P. m cepP GGAAAAGTAAAATGGTTTAACGC 22/23 ("0 ID NO: 15) 

PRG Yl GGGACAGTCAAATGGTTTAATGT 18/23 (SEQ ID NO: 16) 



• • • 

UNI GGtaaaGTaAAaTGGTTtAAcgc PAM (SE0 ID NO: 17) 



MUDDLE B. C CSpA GATGTATTCGTTCATTTCTCTGCTATTC 26/26 (SEQ 10 NO: 18) 
B. C CBpB GATGTGTTTGTGCATTTTTCTGCGATTC 22/26 (SEQ 10 NO: 19) 
B. C CSpC GATGTGTTCGTACACTTCTCCGCTATCC 25/2E (SEQ ID NO: 20) 
S. t CSpS GATGTGTTCGTACACTTCTCTGCTATTC 25/26 (SE0 ID NO: 21) 
S. t CSpB GATGTGTTCGTACACTTCTCTGCTATCC 26/26 (SEQ ID NO: 22) 
B. B CSpB GATGTATTCGTTCATTTCTCTGCTATTC 26/26 (SEQ 10 NO: 23) 
L. m CSpL GATGTATTCGTACATTTCAGCGCTATCC 25/26 (SEQ ID NO: 2<t) 
p. m CBpP GATGTATTCGTTCACTTCTCAGCTACTC 23/28 (SEO 10 NO: 25) 
PRG Yl GATGTGTTTGTACACCAAACTGCCATCA 19/28 (SEQ 10 NO: 26) 



UNI GATGTaTTcGTaCAtttCtCtGCtAtCC TAKRA (SEQ ID NO: 27) 



BACK B. C cspA GGCCCGGCAGCTGGTAACGTAACC 19/24 (SEQ ID NO: 28) 

B. C CSpB GGTCCTGCAGCAGCAAATGTCATC 18/24 (SEQ ID NO: 29) 

B. C CSpC GGTCCGGCAGCTGTTAACGTAACA 18/24 (SEQ ID NO: 30) 

5. t CSpS GGCCCGGCAGCTGGTAACGTAACC 19/24 (SEQ ID NO: 3D 

B. B CSpB GGACCACAAGCTGCTAACGTTACT 19/24 ( S E0 ID NO: 32) 

L. m CBpL GGACCTCAAGCAGCTAACGTTCAA 19/24 (SEQ 10 NO: 33) 

PRG Yl GGTGCAGAGGCAGCTAATGTAACT 18/24 (SEQ 10 NO: 3 < 0 



• • • • • • 

UNI GGacCtgaaGCaGc tAAcGTAacc JOE ( S eq id NO: 35) 
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